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Abstract: Development of a vaccine against congenital infection with human 
cytomegalovirus is complicated by the issue of re-infection, with subsequent vertical 
transmission, in women with pre-conception immunity to the virus. The study of 
experimental therapeutic prevention of re-infection would ideally be undertaken in a small 
animal model, such as the guinea pig cytomegalovirus (GPCMV) model, prior to human 
clinical trials. However, the ability to model re-infection in the GPCMV model has been 
limited by availability of only one strain of virus, the 22122 strain, isolated in 1957. In this 
report, we describe the isolation of a new GPCMV strain, the CIDMTR strain. This strain 
demonstrated morphological characteristics of a typical Herpesvirinae by electron 
microscopy. Illumina and PacBio sequencing demonstrated a genome of 232,778 nt. Novel 
open reading frames ORFs not found in reference strain 22122 included an additional 
MHC Class I homolog near the right genome terminus. The CIDMTR strain was capable 
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of dissemination in immune compromised guinea pigs, and was found to be capable of 
congenital transmission in GPCMV-immune dams previously infected with salivary 
gland-adapted strain 22122 virus. The availability of a new GPCMV strain should facilitate 
study of re-infection in this small animal model. 

Keywords: guinea pig cytomegalovirus; cytomegalovirus strain variation; CMV immime 
evasion; congenital cytomegalovirus infection; congenital CMV vaccines 



1. Introduction 

Development of a vaccine against human cytomegalovirus (HCMV) is a major public health 
priority [1]. The suggestion that passively transferred antibody protects the fetus against infection and 
injury [2] has driven efforts to develop recombinant subunit vaccines targeting major envelope 
glycoproteins, such as glycoprotein B (gB) [3,4]. Although clinical trials of recombinant gB vaccines 
have shown some degree of effectiveness in preventing HCMV infection and disease in high risk 
populations [5,6], vaccine mediated protection with vaccines targeting this single envelope 
glycoprotein appears to be incomplete. Moreover, the effectiveness of natural immunity in preventing 
congenital HCMV infection and its attendant sequelae is itself a matter of some controversy. A number 
of recent studies have described fetal HCMV transmission in women with preconception immunity, 
due to re-infection with new strains of HCMV [7-12]. Such infections can produce sequelae identical 
to those observed in congenitally infected infants born to women with primary HCMV infection in 
pregnancy [13,14]. These observations certainly complicate HCMV vaccine design, and suggest that: 
(1) for full protection, an HCMV vaccine may need to elicit responses superior to those conferred by 
natural immunity; (2) there may be a strong rationale for vaccinating women of childbearing age who 
are already HCMV seropositive, in addition to targeting and immunizing seronegative women, toward 
the goal of preventing re-infection with subsequent transmission of the "new" strain. 

Several clinical studies have documented the phenomena of re-infection in women of childbearing 
age. In one prospective study performed at the University of Alabama-Birmingham, serum specimens 
from 46 women with preconceptional immunity against HCMV obtained during a previous pregnancy 
and a new pregnancy were analyzed for antibodies against the strain-specific epitopes of HCMV 
glycoprotein H (gH), and the nucleotide sequences of the gH gene from seven HCMV isolates were 
determined. Ten of the 16 mothers with infected children (62%) acquired new antibody specificities 
against gH, as compared with only 4 of the 30 mothers of uninfected infants (13%), suggesting that 
acquisition of an infection with a virus expressing a novel sfrain-specific gH genotype during 
pregnancy was associated with congenital fransmission [9]. In another study in Brazil that followed 
7,848 women prospectively, sera from 40 mothers of congenitally infected infants and 109 mothers of 
uninfected control newborns were analyzed for strain-specific anti-HCMV antibodies, based not only 
on polymorphisms within gH binding sites, but also a second antibody reactivity site on gB [14,15]. 
Seven of 40 (17.5%) study women, but only 5 of 109 (4.6%) controls acquired antibodies reactive with 
new HCMV strains during pregnancy (p = 0.002), suggesting that maternal reinfection by new strains 
of HCMV is a major source of congenital infection in this population. In a study of re-infection (based 
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on acquisition of new gB and/or gH antibody specificities) in 205 seropositive women performed by 
Ross and colleagues at UAB, approximately one-third of the study participants (59 of 205) were noted 
to have reinfection, using this definition, during follow-up [10]. The molecular and immunological 
correlates of re-infection are unclear. There is some evidence that gB polymorphisms in clinical 
isolates may be less important for re-infection than polymorphisms in gH and other envelope 
glycoproteins. In a study in Brazil, infections in immunocompetent women with strains corresponding 
to more than one gB genotype were not common [16]. Additionally, in a study of the HCMV strains 
acquired longitudinally in women who developed infection in spite of being enrolled in the 
recombinant gB vaccine trial at UAB [5], there was no selection for or against any non-vaccine gB 
subtype, in spite of women being immunized only with gB protein corresponding to the Towne (gBl 
subtype) strain [17]. Other evidence suggests that the response (or lack thereof) to the envelope 
glycoprotein N (gN) may play a role in predisposing to re-infection with new HCMV strains 
expressing heterologous gB and/or gH genotypes [18-20]. 

Irrespective of the mechanism(s) involved, the issue of re-infection is a major challenge in vaccine 
design. There is increasing evidence that congenital HCMV infections after nonprimary maternal 
infections can lead to symptomatic disease and substantial long-term sequelae. Notably, recent evidence 
fi-om a study at UAB indicated that the incidence of hearing loss in infants infected after nonprimary 
maternal infection was similar to that in infected infants bom to women with primary infection [13], 
although in this study, infants in the primary infection group were more likely to demonstrate 
progressive and severe or profound hearing loss, compared to infants in the non-primary group. 

Since the consequences of re-infection and congenital transmission with a newly acquired strain in 
pregnant seropositive women can be similar to those that occur after primary infection and 
transmission in CMV-seronegative women [10,11,13,14], the study of re-infection in a small animal 
model of congenital transmission would be very useful for the modeling of vaccine strategies to 
prevent maternal re-infection [21,22]. Ideally, HCMV reinfection would be studied in an animal model 
prior to clinical vaccine trials. Unfortunately, the strict species-specificity of cytomegaloviruses 
precludes preclinical testing of HCMV vaccines in animals. However, a number of rodent and primate 
cytomegaloviruses are useful in modeling HCMV vaccines and therapies, given the conservation of 
many immunogenic structural proteins amongst the various viruses [23-25]. 

Among the small animal models, the guinea pig cytomegalovirus (GPCMV) is uniquely useful, 
since, in contrast to other rodent models, transplacental infection of the fetus occurs following viral 
challenge during pregnancy [24,26]. Hence, the GPCMV model is particularly well-suited to the study 
of vaccines against congenital infection. However, until now this model has relied exclusively on a 
single strain of GPCMV, 22122, isolated by Hartley in 1957 [27]. While it has been feasible to study 
re-infection by experimentally inoculating strain 22122 virus into naturally seropositive guinea 
pigs [28], the lack of defined genetic information on other GPCMV strains (e.g., those presumably 
latent in the seropositive animals used in the above mentioned studies) has made it impossible to study 
of the role of strain variation in fetal infection in the setting of preconception immunity. 

In this report, we describe the isolation of a novel strain of GPCMV, the CIDMTR strain. This 
communication represents the first report of detailed characterization of a GPCMV isolate since the 
original publication of isolation of the 22122 strain [27]. Although sequence and ORF structure were 
generally well conserved with the 22122 strain of GPCMV, the CIDMTR strain demonstrated some 
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differences in genome structure, particularly in the right-hand end of the viral genome. There are also 
substantive differences in some protein coding sequences between the two strains, including sequences 
in envelope glycoproteins, suggesting that these proteins may have been the targets of immune 
selection during the evolution of GPCMV in the guinea pig host. We describe in this report the 
morphology and DNA sequence of this newly isolated strain, and report preliminary experiments 
regarding its pathogenesis in vivo. The availability of a second strain of GPCMV should enable the 
study of re-infection and, potentially, the development of vaccine strategies designed to protect against 
maternal re-infection in the guinea pig model of congenital cytomegalovirus infection. 

2. Results and Discussion 

2.1. Isolation of the CIDMTR Strain 

In the course of ongoing vaccine and pathogenesis studies, guinea pigs were screened at the time of 
purchase for GPCMV antibodies, using an ELISA based on GPCMV strain 22122 [29]. Within a group 
of 24 guinea pigs purchased from a commercial source, 5 (21%) were found to be GPCMV-seropositive 
by ELISA. Western blot analysis was performed using sera from commercially purchased, 
ELISA-positive animals using purified strain 22122 virions as the source of target antigens (Figure 1). 
These studies confirmed that sera from these animals were broadly cross-reactive with GPCMV 
strain 22122 virion-associated polypeptides, suggesting that these animals were infected with GPCMV 
strains that were closely related antigenically to strain 22122. 

Figure 1. Western blot analysis of sera from three "naturally seropositive" commercially 
purchased outbred Hartley guinea pigs, (a) Western analysis using a pooled, high-titer 
polyclonal anti-GPCMV antisera from guinea pigs immunized with adjuvanted viral 
particles [30] or sera from three "natural seropositive" guinea pigs (GPl, GP2, GP3) 
obtained from a commercial supplier, using sucrose gradient-purified virions from 22122 
strain as target antigen. Two independently derived, monospecific anti-GPCMV gB 
antibodies (moab 29-29 [31] (gBl) and moab IE321 (gB2)) are included as controls, 
(b) Photomicrographs comparing CPE of low-passage (P2) CIDMTR virus (right panel) to 
GPCMV strain 22122 (left panel) following infection of GPL cells. CIDMTR plaques are 
more rounded, with elliptical appearance, compared to the more spindle-like morphology 
of 22122-infected fibroblasts. Plaques photographed at 60x magnification. 
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One of the seropositive animals identified in these prehminary experiments was immunosuppressed 
with cyclophosphamide, 100 mg/kg [32], and seven days later the animal was sacrificed and salivary 
gland (SG) homogenates harvested. Presumptive virus in the SG homogenate was passaged in vivo by 
inoculation of two GPCMV seronegative inbred strain 2 guinea pigs with 1 mL SG homogenate by 
subcutaneous route in the dorsal neck. Both of these animals demonstrated DNAemia (3.2 and 
6.9 X 10^ genomes/mL of blood, respectively) at 14 days post-inoculation. Three weeks following 
inoculation, these animals were also immunosuppressed with 100 mg/kg of cyclophosphamide. One 
week later, these animals were sacrificed and SG homogenates were cultured on guinea pig lung 
fibroblast cells (GPLs). Eleven days after inoculation plaques with characteristic cytopathic effect 
(CPE) were observed in one of the SG homogenate cultures (designated Po). Supernatant from this 
flask was used to inoculate GPLs to generate a Pi stock of virus. When CPE was extensive 
(approximately 2 weeks later), an aliquot was removed for electron microscopy (EM) studies 
(described below, Section 2.2) and the remainder of this flask was expanded for large-scale virus 
propagation (20 flasks). These flasks were then incubated for an additional week prior to harvest of P2 
viral stock (Figure lb). Some of these infected flasks were used for DNA purification for lllumina 
MiSeq and Pacific Biosciences PacBio RS sequencing, as described below. The remainder of this 
stock was used for in vivo challenge experiments in guinea pigs, as described in Section 2.4. 

2.2. Morphological Analyses by EM 

To confirm that the isolated virus had morphological characteristics of a cytomegalovirus, EM was 
performed. Analysis of Pi virus stock by conventional microscopy (Section 2.1) revealed 20% of the 
cells showed cytopathic effect. Cells were enlarged with large intranuclear and intracytoplasmic 
basophilic inclusions visible upon examination by light microscopy on thick section of plastic 
preparation stained with Toluidine blue (data not shown). Ultrastructurally, both intranuclear and 
intracytoplasmic inclusions were noted, interpreted as representative of nuclear and cytoplasmic viral 
factories (Figure 2A,B)- Nuclear inclusions were typical for the center of replication and assembly 
characteristic of Herpesvirinae (Figure 2C). These nuclear factories showed a large amount of 
fibrillary electron dense material admixed with moderate to large numbers of empty A-capsids, 
scaffold-containing B-capsids, and DNA-containing C-capsids that are characteristic of herpesvirus 
capsid formation and genome packaging (Figure 2C; most capsids in this image are B-capsids). By 
comparison, a virtually identical appearance was noted in EMs from sfrain 22122-infected fibroblasts 
(Figure 2H). First envelopment was acquired at the inner nuclear membrane. The EM morphology 
supports a model whereby de-envelopment of nucleocapsids takes place at the outer nuclear membrane 
during egress, with the second envelopment and tegumentation occurring in the cytoplasm factory 
(Figure 2D). Rearrangement of Golgi cistems, Golgi vesicles, multivesicular body and endoplasmic 
reticulum in conjunction with formation of nucleocapsids produced large aggregations that were often 
mixed with electron dense material. Numerous dense bodies were present in the intercellular space 
(Figure 2A,F). On negative contrast preparation, a large number of non-infectious enveloped particles 
(220.24 ± 48.15 nm) and dense bodies (357.83 ± 97.85 nm) were observed (Figure 2G). Infectious 
mature virions consisted of an envelope containing a 116.5 ± 2.68 nm in diameter icosahedral capsid 
with capsomeres of approximately 1 1 .66 ± 1.87 nm. The CIDMTR sfrain of GPCMV presented similar 
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cytopathic and pathogenic effects as those demonstrated by ATCC strain 22122 [33-35]. Nevertheless, 
some differences were noted between these two strains. The ATCC strain in contrast to the CIDMTR 
strain, demonstrated more efficient infection of fibroblasts, even at the same multiplicity of infection 
(MOI; MOI of 1), with over 80% of fibroblastic cells exhibiting cytopathic effect. The nuclear and 
cytoplasmic factories were less pronounced in the CIDMTR-infected cells, with generation of reduced 
quantities of infectious and non-infectious enveloped viral particles and dense bodies (data not shown). 

Figure 2. Transmission electron microscopy (EM) microphotographs of fibroblasts 
infected with the CIDMTR Strain of GPCMV. (A) Plastic embedded preparation 
contrasted with uranyl acetate/lead citrate of two enlarged fibroblasts showing cell 
intranuclear and cytoplasmic viral inclusions and numerous dense bodies (white 
arrowheads) in the intercellular space (bar = 10 |j,m). (B) Magnification of another 
CIDMTR-infected cell revealed nuclear (N) viral replication and nucleocapsid assembly 
sites (small square), as well as a large maturation site (asterisk) and dense body formation 
in the cytoplasm (large square, bar = 2 |j,m). (C) Additional magnification in which 
replication and capsid assembly in the nucleus (small square) is appreciated; note empty 
capsids (arrow) and DNA containing nucleocapsids (arrowhead; bar = 0.5 |j,m). 

(D) Magnification of virus maturation sites within the cytoplasm (large square in panel 
(B)) reveals electron dense material formed by aggregation of nucleocapsids and cell 
organelles such as Golgi systems, endoplasmic reticulum and vesicles (bar = 0.5 |j,m). 

(E) Capsid is demonstrated becoming coated with tegument proteins and then acquiring its 
final envelope by budding into vesicles (arrows; bar = 0.5 |j,m). (F) Several dense bodies 
are present in the intercellular space (bar = 0.5 |j,m). (G) Negative contrast preparation of 
enveloped B-capsid (arrow) and dense body (asterisk, bar = 0.5 |j,m) is demonstrated. 
(H) EM of strain 22122-infected cells demonstrating virtually identical morphology; A, B 
and C capsids are identified as described in text. 




Viruses 2014, 6 



454 



Figure 2. Cont. 




2. 3. Sequencing and Sequence Analysis 

Viral DNA (P2) was purified from the CIDMTR strain after two passages in GLFs and subjected to 
sequence analysis using the MiSeq and PacBio platforms as described in Section 3. The CIDMTR 
genome was 232,778 nt in length. Overall sequence homology to strain 22122 (accession 
#KC503762.1) was 98%, consistent with the hypothesis suggested by immunoblotting that CIDMTR is 
a GPCMV and not a novel and distinct betaherpesvirus of guinea pigs. As discussed further in 
Section 4, a possibility existed that genetic diversity of GPCMV isolates could be restricted due to 
bottlenecks in the breeding history of Hartley guinea pigs, from which both 22122 and CIDMTR are 
derived. Historically, HCMV strains were differentiated on the basis of restriction pattern 
polymorphisms. Epidemiologically unrelated isolates, considered unique HCMV strains, exhibit 
predominantly identical restriction patterns with occasional unique fragments. To compare CIDMTR 
to 22122 in these terms, genomic sequences for each virus were used to generate in szYzco-predicted 
restriction patterns for three enzymes {EcoR I, Hind III, and Xba I). As shown in Figure 3a, the 
majority of fragments matched between the two virus genomes in all three predicted restriction 
enzyme patterns, while for each restriction pattern a few unique (unmatched) fragments were evident. 
These results are consistent with CIDMTR and 22122 being distinct GPCMV strains in a manner 
similar to that of HCMV strains. 

Sequence comparison revealed three regions of striking discrepancies between the two genome 
sequences (Figure 3b). The first was located between nucleotides 199227 and 203071 of the 22122 
sequence. Within this 3,845-bp region, 1,437 bps were entirely missing from the CIDMTR genome, 
while the remainder corresponded with a poorly conserved (50% average nucleotide conservation) 
2,366-bp region in CIDMTR. As a result, one of the three gpl38 family ORFs previously annotated in 
22122 was missing from CIDMTR (Figure 3c). Curiously, this region corresponds to a hot-spot for 
spontaneous deletions that occur in response to over length genomes in GPCMV that also roughly 
corresponds to regions of instability/rearrangements in HCMV and rhesus cytomegalovirus [36]. 

The second region of discrepancy was located between nucleotides 221608 and 224743 of the 
CIDMTR sequence. Within this 3,136-bp region, 1,424 bps were entirely missing from the 22122 
genome, and the remainder correspond with a poorly conserved (68% average nucleotide conservation) 
1,684-bp region in 22122. Surprisingly, the additional sequences in CIDMTR encoded a fourth 
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putative MHC class I homolog, annotated gp 147.1, that lies adjacent to three MHC class I homologs 
(gpl47, gpl48, and gpl 49) that were previously identified in 22122 (Figure 3c). 

The third region of discrepancy was located between nucleotides 230176 and 231697 of the 22122 
sequence. Within this 1,522-bp region, 107 bps were missing from the CIDMTR genome, and the 
remainder correspond with a poorly conserved (51% average nucleotide conservation) 1,383-bp region in 
CIDMTR. This discrepant region partially overlaps gpl 49 and results in CIDMTR- and 22129-encoded 
gpl49 proteins that lack amino acid homology in residues 1-134 (22122) or 1-164 (CIDMTR) but are 
highly conserved within the remaining C-terminal 494 residues. 

Finally, like 22122, the CIDMTR genome was found to have direct terminal repeat sequences, 
although the CIDMTR repeats were shghtly shorter (841 nt) than those in 22122 (953 nt). 

Figure 3b highlights the regions of DNA sequence dissimilarity when the CIDMTR strain is 
compared to strain 22122. Figure 3c illustrates in map format the regions of genomic discontinuity. To 
confirm that these large-scale rearrangements did not arise during passage of virus in cell culture, PCR 
and sequence analyses were performed directly on both DNA purified from the original salivary gland 
homogenate (no tissue culture passage), and on DNA from the tissue culture-derived isolate (Po), with 
identical results (see Section 2.5 and Figure 4). The sequences of the PCR-generated fragments were 
identical to the lUumina/PacBio genome sequence, confirming that discontinuities with strain 22122 
did not arise as an artifact of tissue culture passage. It therefore appears that, as it appears in its natural 
host, the CIDMTR strain lacks the gpl 38.1 gene found in 22122. Similarly, 22122 lacks the fourth 
MHC class I homolog gene (gpl 47.1) found in CIDMTR. As the original SG extract fi-om which 
22122 was cultured is no longer available, it is not possible to conduct similar studies to determine 
whether gpl 47.1 was absent from 22122 prior to its initial isolation in cell culture or was lost during 
subsequent in vitro/in vivo passage. Nevertheless, these results suggest that naturally occurring strains 
of GPCMV may be polymorphic with respect to the presence or absence of entire genes. Similar 
findings have been observed when comparing the Smith strain of murine cytomegalovirus to strains 
isolated from wild mice [37]. Further analyses of other primary virus isolates from "naturally infected" 
guinea pigs will be necessary to confirm this hypothesis and to determine if such polymorphisms are 
limited to specific genes, such as gpl 38.1 and perhaps gpl 47.1, or include others. 

Table 1 summarizes the predicted ORFs identified in the CIDMTR sfrain. ORFs that are highly 
conserved in cytomegaloviruses are noted in the table in upper case/bold font (e.g., GP55). In contrast, 
ORFs appearing unique to GPCMV are noted in lower case (e.g., gpl 3 8). The "C" designation 
in Table 1 refers to the complimentary strand. Splicing sites are predicted based on previously 
published reports and these need empiric confirmation in future studies unless otherwise indicated 
(Figures 4 and 5); exons are described as coding exons. 

Amino acid sequences predicted from CIDMTR genes encoding conserved envelope glycoproteins 
were compared to those from sfrain 22122. There was sfriking sequence conservation of the gB 
homolog; although 13 SNPs were noted, only 2 coding changes were observed (overall identity 
of 99%). In contrast, sequence analyses of GP74 (gO) demonsfrated the greatest degree of sequence 
divergence, with 303/374 (81% identity) noted, followed by GP75 (gH), with 613/726 (84%) identity 
compared to the 22122 strain. GP73, the homolog of HCMV glycoprotein N, a protein known to 
exhibit substantial sequence divergence across clinical isolates [19,38,39], demonstrated 92% identity 
between the CIDMTR and 22122 strains. These results confirm that CIDMTR represents a GPCMV 



Viruses 2014, 6 



456 



strain distinct from 22122 as the amino acid sequences of gH, gO, and gN clearly diverged through 
gradual evolutionary processes and cannot be attributed to abrupt, limited events such as insertions, 
deletions, or duplications. This further implies that despite potential bottlenecks imposed by selective 
breeding, GPCMV, as it exists within commercial breeding colonies, exhibits significant strain 
diversity of a nature similar to that of HCMV. These findings suggest that GPCMV may provide a 
useful model to study the impact of naturally occurring strain variation on viral pathogenesis, 
particularly in the area of nonprimary infection and disease. 

Figure 3. Strain-specific genome structure differences between CIDMTR and 22122. 
(a) In silico analysis of predicted restriction endonuclease profiles for CIDMTR strain and 
22122 (ATCC) strain with enzymes EcoRl, Hin&lll, and Xbal. Predictions and profile 
generated using program CLC Main Workbench 6 [40]. (b) Dot-matrix comparison of 
CIDMTR vs. 22122 genomic sequences. Two genome regions demonstrating sequence 
variability resulting in ORFs unique to each strain are indicated by shaded boxes, 
(c) Linear map comparison of the major areas of genome discontinuity. Linear maps 
prepared using XPlasMap [41]. Colored boxes in blue represent predicted conserved ORFs. 
Light red and green colored ORFs demonstrate discordance between the two strains. In the 
CIDMTR strain, an additional ORF not annotated in 22122, 149.1, is also noted (dark red). 
Dashed lines indicate the positions where the sequences differ. See Table 1 for a full list of 
sequenced gene annotations. 
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Figure 3. Cont. 
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Table 1. GPCMV Strain CIDMTR Predicted ORFs. 
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UL46 homolog 


GP47 




74,674 


77,787 


1037 


UL47 homolog 


GP48 




77,784 


84,155 


2123 


UL4o homolog 


GP48.2 


c 


84,238 


84,468 


76 


UL48a nomolog 


GP49 


c 


84,479 


86,119 


546 


T TT /int.™ — « ™ 1 ™ ~ 

UL49 nomolog 


GP50 


c 


86,088 


87,158 


356 


T TT Zf\ 1 „1 

U LdU nomolog 


GP51 


c 


87,281 


87,580 


99 


UL5 1 homolog; terminase subunit 


GP52 




87,900 


89,480 


526 


T TT C 1 1 

UL52 homolog 


GP53 




89,473 


90,459 


328 


T TT C T 1, „ „ 1 ^ 

UL53 homolog 


gp53.1 


c 


90,196 


90,537 


113 


Homology to KhLMV rho6; Yr O65I /9.1 


GP54 


c 


90,551 


93,904 


1117 


UL54 homolog; DNA polymerase 


GP55 


c 


93,947 


96,652 


901 


UL55 homolog; glycoprotein B 


GP56 


c 


96,549 


98,816 


755 


UL56 homolog; terminase subunit 


GP57 


c 


98,967 


102,650 


1227 


UL57 homolog 


gp57.1 


c 


104,411 


104,926 


171 


Homolog to RCMV r23. l"* 


GP69 


c 


108,260 


111,430 


1056 


UL69 homolog 


GP70 


c 


112,136 


115,339 


1067 


UL70 homolog; helicase-primase 


GP71 




115,338 


116,114 


258 


UL71 homolog 


GP72 


c 


116,277 


117,350 


357 


UL72 homolog; dUTPase 


GP73 




117,432 


117,833 


133 


UL73 homolog; glycoprotein N 
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Table 1. Cont. 



GPCMV Open Reading Frames (< 


jKrsj: ciuiviiK strain 


ORF 


Strand 


Begin 


End 


Codons 


Notes 


GP73.5exl 




117,860 


117,869 


3 




GP73.5ex2 




119,005 


119,198 


64 




GP74 


C 


117,780 


118,904 


374 


UL74 homolog; glycoprotein 0 


GP75 


c 


119,349 


121,529 


726 


UL75 homolog; glycoprotein H 


GP76 




121,694 


122,533 


279 


UL76 homolog 


GP77 




122,247 


124,091 


614 


UL77 homolog 


GP78 




124,473 


125,717 


414 


UL78 homolog; 7-1 MR GrCR superiamily 


GP79 


c 


125,914 


126,861 


315 


UL79 homolog 


GP80 




126,722 


129,031 


769 


T TT Of\1 1 /~1TI J1 T J 

UL80 homolog; CMV protease 


GP80.5 




127,610 


129,031 


473 


T TT on C 


GP82 


c 


129,324 


130,889 


521 


T TT 0 1 1 T 1 

UL82 homolog; pp71 


GP83 


c 


131,109 


132,809 


566 


ULo3 homolog; pp65 


GP84 


c 


133,051 


134,487 


478 


T TT OA ^ ^ 1 ^ ,, 

UL54 homolog 


GP85 


c 


134,785 


135,696 


303 


T TT OC t,^™.^1^™ 

UL55 homolog 


GP86 


c 


135,977 


140,026 


1349 


UL80 homolog 


GP87 




140,407 


143,328 


973 


T TT on -^1 ^™ 

ULo / homolog 


GP88 




143,231 


144,499 


422 


UL88 homolog 


GP89ex2 


c 


144,545 


145,675 


376 


UL89 homolog; terminase subunit, exon 2 


GP91 




146,102 


146,365 


87 


ULyi homolog 


GP92 




146,362 


146,991 


209 


T TT m 1 1 

UL92 homolog 


GP93 




146,957 


148,732 


591 


T TT i~\ ^ 1 1 

UL93 homolog 


GP94 




148,644 


149,681 


345 


UL94 homolog 


GP89exl 


c 


150,032 


150,913 


291 


UL89 homolog; terminase subvinit, exon 1 


GP95 




150,967 


152,235 


422 


UL95 homolog 


GP96 




152,468 


152,830 


120 


UL96 homolog 


GP97 




152,910 


154,727 


605 


UL97 homolog; protein kinase 


GP98 




154,747 


156,531 


594 


UL98 homolog; allcaline nuclease 


GP99 




156,444 


156,965 


173 


UL99 homolog; pp28 


gp99.1 




157,155 


157,769 


204 


Homology to KCMV r4 


GPlOO 


c 


157,278 


158,327 


349 


ULIOO homolog; glycoprotein M 


GP102 




158,657 


160,942 


761 


T TT 1 1 1 

UL102 homolog 


GP103 


c 


161,127 


161,852 


241 


UL1U3 homolog 


GP104 


c 


161,815 


163,908 


697 


UL104 homolog; portal 


GP105 




163,748 


166,531 


927 


UL105 homolog; helicase-primase 


GP112exl 




176,745 


177,498 


315 


ULl 12 homolog; replication accessory, ex 1 


GP112ex2 




177,606 


177,782 




ULl 12 homolog; replication accessory, ex 2 


GP112ex3 




178,115 


178,131 




T TT ITT 1 1 1 ■ . ■ 0 

UL122 homolog; replication accessory, ex 3 


GP114 


c 


179,126 


179,920 


264 


ULl 14 homolog; uracil glycosylase 


GP115 


c 


179,986 


180,762 


258 


ULl 15 homolog; glycoprotein L 


GP116 


c 


180,755 


181,654 


299 


Homology to IHV tl 16 ; rc receptor//lg 


GP117 


c 


181,877 


183,262 


461 


ULl 17 homolog 


gpll9.1 


c 


184,418 


185,167 


249 


Similar to MCMV in ACE95619.1 


GP121.2 


c 


185,160 


185,834 


224 


Betaherpesvirus B7D8, accession AFK83957 


GP121.4 


c 


186,299 


187,174 


291 


UL121 homolog; Tupaia tl21.4, NP 116476 


GP122ex3 


c 


187,993 


189,677 


677 


ULl 22 homolog; HCMV IE2 


GP122ex2 


c 


191,079 


191,311 


SGP122exl 


c 


191,403 


191,518 


GP123ex3 


c 


189,907 


190,985 


475 


ULl 23 homolog; HCMV lEl 


GP123ex2 


c 


191,079 


191,311 


^GP123exl 


c 


191,403 


191,518 
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Table 1. Cont. 



GPCMV Open Reading Frames (< 


3RFs): CroMTR Strain 


ORF 


Strand 


Begin 


End 


Codons 


Notes 


GP128 




195,400 


196,455 


351 


bimilar to Bat HbV 13126; Ubzz family 


gpl30 




196,655 


196,999 


114 




GP129ex3 


C 


196,432 


196,690 


178 


Homolog 01 HCMV UL128 


GP129ex2 




196,768 


196,890 


GP129exl 


C 


196,974 


197,128 


GP131ex2 


c 


197,133 


197,469 


191 


T T ™ ™ 1 ™ ~ ™ X* T /r\ r T TT 1 O A 

Homolog 01 HCMV UL130 


GPlSlexl 


c 


197,550 


197,788 


GP133 


c 


197,788 


198,174 


128 


Homolog ot HCMV UL131 


GP134 


c 


198,268 


198,951 


227 




gpl38.2 


c 


199,367 


200,875 


502 




gpl38.3 


c 


201,090 


202,607 


505 




gpl39 


c 


202,706 


204,793 


695 


IHV 15; US22 supertamily 


gpl40 




204,522 


204,929 


135 


Homology to CCMV UL132 


gpl41 


c 


205,053 


206,660 


535 


HCMV Ub23; Ub22 supenamily 


gpl42 


c 


206,928 


208,622 


564 


HCMV US24; US22 superiamily 


gpl43 


c 


AO one 

208,875 


210,866 


663 


IHV 15; US// supertamily 


gpl44 


c 


211,109 


213,403 


764 


US26; US22 gene superfamily 


gpl45 


c 


213,677 


215,575 


632 


HCMV IRSl/TRSl; US22 superfamily 


gpl46 


c 


215,932 


217,914 


660 


HCMV IRSl/TRSl; US22 superfamily 


gpl47.1 


c 


221,739 


222,935 


398 


MHC class I homolog 


gpl47 


c 


223,124 


224,218 


364 


MHC class I homolog 


gpl48 


c 


225,379 


226,560 


393 


MHC class I homolog 


gpl49 


c 


228,236 


230,209 


657 


MHC class I homolog 


gpl49.1 




230,163 


230,465 


100 


Unique ORF sequence in CIDMTR 



An observation of note is the codon, ATA, spanning nucleotides 191516-191518 (minus strand). This 



codon is ATG in strain 22122, which was annotated as the start codon for both IE2 (GP122; exons 3, 4 and 5) 
and lEl (GP123; exons 3, 4 and 6) [42^4]. Whether alternative sphcing is occurring in the CIDMTR strain, 
or the start codon for the IE1/IE2 proteins is different between the two strains of GPCMV (i.e., in exon 4, not 
exon 3), requires further evaluation; this is described in greater detail in Section 2.4. 

2.4. Sequence Characterization ofDNA from Original SG Homogenate in Region of IE 1/2 Start Codon 

A surprising finding from the DNA sequence analysis was the finding of an ATA codon as the 
putative start codon of the IE1/IE2 protein product [42]. To examine this issue, sequence analyses of 
PCR-ampUfied DNA from the original salivary gland homogenates, as well as salivary gland 
homogenates from further in vivo passages, were undertaken. DNA from the original salivary gland 
homogenate that was the source of the CIDMTR isolate was compared to DNA from GPL cells 
infected with CIDMTR following cell culture passage. The PGR was done with the primer pair 
designated IE splice 3' PI (5'-TGGGAAGGGATGTGTGTGAAG-3') and IE splice 5' PI (5'- 
GTGGTTGTAGGTGTGGTGGTGA-3'), which was predicted to produce an 864-bp product from 
GIDMTR DNA. The purified DNA was cloned and three clones from each reaction were sequenced. 
This analysis indicated that the original salivary gland source of the CIDMTR strain contained viral 
DNA that encoded an ATG codon (as does 22122), the putative start codon for the IE1/IE2 protein 
product, but that the tissue cultured-derived CIDMTR strain, following two passages in fibroblasts, 
had a DNA sequence corresponding to an ATA codon. 
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To further examine splicing of the IE region from the CIDMTR strain (tissue culture-derived; 
CIDMTR-TC) virus, reverse-transcriptase PCR was performed on RNA harvested at immediate early 
times post- infection. RNA was extracted from GPL cells infected with 22122 or CIDTMR at 4 hours 
post-inoculation cDNA was synthesized from 1 |ag of total RNA. Conventional PCR was carried out 
using cDNA as template. PCR used primer pair IE exon4 Fl (5'-CCGCATTTTCTGAGGGTGTT-3') 
and IE exonl Rl (5'-CATGCCAGTTCCCTGTGCTG-3'), and primer pair IE exonS Fl (5'- 
GGAAGATGTCCACTTGGAAG-3') and IE exon2 Rl (5'- ACGTAGCCGAGAAGTAAAGT-3'). 
The expected product sizes for primer pair IE exon4 Fl/ IE exonl Rl and IE exon5 Fl/IE exon2 Rl 
were 473 bp and 508 bp, respectively. 

A product of the expected size was purified, cloned into pCR2.1, and multiple clones with inserts 
from each reaction were sequenced. Notably, all CIDMTR-derived clones contained ATA at the 
putative (collinear) IE1/IE2 gene product start codon, confirming the DNA sequencing analysis. In 
contrast, all the 22122 clones contained an ATG at this position. The observed exon junctions were as 
described in the 22122 strain. The ATG codon in exon 3 was originally annotated as the putative IE 1/2 
start codon in 22122 [41] and the DNA sequence analysis of DNA from the CIDMTR strain purified 
directly from the salivary gland homogenate also demonstrated an ATG codon in exon 3. Thus, the 
finding of this ATA by DNA and RT-PCR sequencing in tissue culture-passaged CIDMTR virus was 
surprising. Since IE 1/2 is abundantly expressed and ATA is known as an inefficient initiator of protein 
translation, we looked for ATGs that might serve as alternative start sites. Tissue culture -passaged 
CIDMTR strain may employ a start codon in exon four for the IE 1 and IE2 gene products (Figure 4), 
although the Kozak consensus sequence of this putative start codon (T at -3, A at +4) is relatively 
weak [45]. 

Figure 4. RT-PCR mapping of CIDMTR strain and 22122 sfrain splice sites, (a) Cartoon 
representation of the IE 1/2 gene locus illustrating positions of primer pairs used for 
RT-PCR. Introns are indicated by straight lines and exons are blue boxes, although exon 1 
and 2 are non-coding, (b) Results of RT-PCR reactions e4-l, e5-2, and GAPDH (control) 
using RNA from uninfected cells or cells infected with 22122 or CIDMTR. (c) RT-PCR 
consensus sequence of CIDMTR strain. Exon junctions are highlighted in blue. The first 
gray highlighted sequence is an ATA codon; this is an ATG codon and the putative start 
codon for the 22122 IE 1/2 proteins, but is not conserved in the CIDMTR sequence in tissue 
culture passaged virus. The second gray highlighted sequence may therefore represent the 
putative start codon for CIDMTR TC IE 1/2. 

(a) 

Exon 6 Exon 5 Exon 4 Exon 3 Exon 2 Exon 1 

22122 IE gene 







— n- 









T t t 

stop Stop Start 

codon codon 



Primer pair e4-l 



Primer pair e5-2 
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(b) 



control 



Figure 4. Cont. 

22122 CIDMTR 



-RT +RT -RT +RT -RT +RT 



500bp 



500bp 



200bp 




Primer Pair e5-2 



Primer Pair e4-l 



GAPDH 



(C) 



CTTCCAAGTGGACATCTTCCflAGGACGCCGCCATCTCGGGAGGACCAGCCA^GTTGAAGCTTCGGTTTTAAACCCGTCGGGCGCTCGGCTTTTACTG 
TTTAAGACCGCTTTTTCAACTTTACTTCTCGGCTACGTGCCTGGACAGTAAACCTTGCCAGACGCCCGGTTCTTCTGCCCCCCCCCCCCATCAATTCA 
A 'el«Mll>il GAA ACA AAC AGA GAG TCC ACT TCC CCG TCC AGA AAC TCC CCC GTA TCA GAA TCA TCC ACT GGG GCT GCC 

lETNRESTSPSRNSPVSESSTGAA 
GCC ACA AGT GAT GCC GCA TCA AGC TCT CAG AAT AAT GAC ACC C lSiS TCA AAT GCC ACA CCG TCC CTC CTC CGA GAC 

ATSDAASSSQNMDTRSNATPSLLRD 
GAA TAC GAG AGC GTC TCC ACA TTC AAA GAT GTA TTG GAG AGA GTC AAC AAC ACG TTC GAC TTA GAT CAG TTC 

EYESVSTFMKDVLERVNNTFDLDQF 
GTA ACT AAC CCA GAC ATC GAT CTC AAT TTT CAA CCA ACA CCC TCA GAA AAT GCG GAG ATA GAA AGT TTT ACC AGT 

VTNPDIDLNFQPTPSENAEIESFTS 
GGA TCC GTC TCC AGG GGC GAT ATG TCA AAC GAT AGA CTG AGC GCT GTT GCA SHt CGC GCC CTG GTA CTC GTC ATG 



V 



R 



D 



M 



N 



D 



R 



V 



V 



R 



V 



V 



M 



AAA CGA AAC ATT CAG GAG TTA CAA GAT AGT CTT AGG CAG CAC AGG GAA CTG GCA TG 
KRNIQELQDSLRQHRELA 



To further define IE transcription from the CIDMTR-TC virus, 5' RACE was performed. RNA was 
extracted and cDNA was synthesized with a specific sequence attached to the 5' UTR for capped 
mRNAs. After ligation of a specific RNA oligo to the 5'UTR of mRNAs, cDNA was synthesized using 
oligo-dT and random hexamers. The cDNA was used as template for a first round of PCR using 
GeneRacer 5' Primer and primer IE2 P5 (5'-GGCGTCAATGGGCTCGGGTTTGAT-3'). Two nested 
PCR reactions were then performed with the GeneRacer 5' Nested Primer, per the manufacturer's 
specifications, and IE2 P5 or IE exon3 PI (5'-GGCAGCCCCAGTGGATGATTCTGATA-3') 
(Figure 5b). Purified DNA was cloned and sequenced. Results matched the previously described splice 
sites for strain 22122 [42]. Using this primer combination, the RT-PCR product was 734 nt in size 
(5 nt longer than the homologous ATCC/22122 cDNA). The substitution of ATA for the putative 
ATCC/22122 IE2/3 start codon was once again noted. We did not observe other in-frame start codons 
for the CIDMTR strain IE 1/2 other than the one identified in exon 4. Further experiments will be 
required to identify the start codon for IE 1/2; possibly, both 22122 and CIDMTR strains use the start 
codon in exon 4, in spite of its suboptimal Kozak consensus. Alternatively, growth of the CIDMTR 
strain in fibroblasts may select for mutation of the exon 3 start codon, even upon minimal passage. 
Development of guinea pig epithelial/endothelial cell for isolation and passage of CIDMTR virus is an 
important ftiture priority, to determine whether in vivo and in vitro passage result in different selective 
pressure on viral sequences. 
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Figure 5. 5' RACE analysis of CIDMTR IE region, (a) Schematic of IE gene indicating 
primers pair used in RACE, (b) nPCR gels for 22122 and CIDMTR using RNA purified 
under IE conditions. The CIDMTR RACE product is 734 nt compared to 729 nt for 
the 22122 strain, explaining its slightly higher migration by gel electrophoresis, (c) RACE 
consensus sequence of CIDMTR IE. Exon junctions highlighted in blue. First gray 
highlighted sequence is co-linear to the start codon for 22122; second highlighted sequence 
represents putative start codon for CIDMTR IE 1/2. 



(a) 



Exon 6 



Exon 5 



Exon 4 



Exon 3 Exon 2 

t 



Exon 1 



RNA oligo 



Stop 
codon 



Stop 
codon 



Start 
codon 



-5' Primer 



Gene Specific Primers (22122 and CIDIVITR) 



5' Nested Primer 



(b) 



22122 



CIDMTR 



800 bp 



500 bp 




nPCR 
IE2P5 



nPCR IE 
exon3 PI 



(C) 



AGAGCGAGGAGGCTCTGCAAGAGAGCACATCGTCCAGGAGCGACACCCCGCGGCTGCCCAGAGGACTGCACCTTGGACCACGCCTGCTTCGGAGCGCC 
GCCATTTTCTTCACCAGGTGGCCACGCTTCCAAGTGGACATCTTCCAAGTGGACATCTTCCAAGGACGCCGCCATCTCGGGAGGACCAGCCA^GTTG 
AAGCTTCGGTTTTAAACCCGTCGGGCGCTCGGCTTTTACTGTTTAAGACCGCTTTTTCAACTTTACTTCTCGGCTACGTGCCTGGACAGTAAACCTTG 
CCAGACGCCCGGTTCTTCTGCCCCCCCCCCCCATCAATTCAA^^^ GAA ACA AAC AGA GAG TCC ACT TCC CCG TCC AGA AAC TCC 

lETNRESTSPSRKS 
CCC GTA TCA GAA TCA TCC ACT GGG GCT GCC GCC ACA AGT GAT GCC GCA TCA AGC TCT CAG AAT AAT GAC ACC C^ 

PVSESSTGAAATSDAASSSQKNDTR 
TCA AAT GCC ACA CCG TCC CTC CTC CGA GAC GAA TAG GAG AGC GTC TCC ACA TTC ^ AAA GAT GTA TTG GAG AGA 

SNATPSLLRDEYESVSTFMKDVLER 
GTC AAC AAC ACG TTC GAC TTA GAT CAG TTC GTA ACT AAC CCA GAC ATC GAT CTC AAT TTT CAA CCA ACA CCC TCA 

VNNTFDLDQFVTNPDIDLNFQPTPS 
GAA AAT GCG GAG ATA GAA AGT TTT ACC AGT GGA TCC GTC TCC AGG GGC GAT ATG TCA AAC GAT AGA CTG AGC GCT 

ENAEIESFTSGSVSRGDMSNDRLSA 
GTT GCA ^C ACG CCC GAG GGA TGG CTG ACA GCT TCC ATC TCA CAG ATC AAA CCC GAG CCC ATT GAC GCC 

VADTPEGWLTPSISQIKPEPIDA 



2.5. PCR Confirmation of GPCMV-CIDMTR Genome Structure 

To confirm the structure of the CIDMTR strain compared to the 22122 strain, PCR was performed 
on viral DNA from both strains, using primers spanning the mismatched regions observed in the 
sequence analysis comparisons (Figure 3b, boxed/shaded regions). Two primers pairs were used for 
each region of mismatch (mismatch region 1 and mismatch region 2). PCR was performed using 
primer pairs mismatch-region 1 Fl/Rl and mismatch-region 1 F2/R2, that amplify a ~4-kb region for 
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the 22122 strain, but a ~2.5-kb region for the CIDMTR strain. The amplification region using primer 
pairs mismatch-region 2 Fl/Rl or mismatch-region 2 F2/R2 was predicted to be ~2.2 kb for the 22122 
strain and -3.7 kb for the CIDMTR strain. Primer sequences are indicated in Table 2. The results of 
these experiments confirmed that the genome configuration was precisely as predicted from the deep 
sequence analysis. Moreover, the PCR was also performed on DNA purified directly from the original 
salivary gland homogenate from which the CIDMTR isolate was obtained (data not shown). These 
results confirmed that the insertions and deletions identified by sequencing (Figure 3 c) did not arise as 
an artifact of limited passage of virus in cell culture. The PCR results are shown in Figure 6. 



Table 2. Primer Sequences. 



Primer Name 


Primer Sequence 


Mismatch Region 1 Fl 


5'-GTGAGACGTA AGAATAGCTTGC-3 ' 


Mismatch Region 1 F2 


5'-GATCCTTAGACTCTATCACGG-3' 


Mismatch Region 1 Rl 


5'-GTGTTGTCACAATTGGCACATG-3' 


Mismatch Region 1 R2 


5'-ACATGGTCACGACAGAATC-3' 


Mismatch Region 2 Fl 


5'-GTGGACAGGATCCCCAAATT-3' 


Mismatch Region 2 F2 


5'-CCAAATTTCTGTCGTCGGCG-3' 


Mismatch Region 2 Rl 


5'-TGTTTCCGTGTCTGTCTCCGT-3' 


Mismatch Region 2 R2 


5 '-GTCTTAGCCCGAGACCTTC-3' 



Figure 6. Strain-specific PCR assays differentiate ATCC 22122 strain of GPCMV from 
CIDMTR strain, (a) Primers were designed to amplify across discontinuous/unique regions 
as demonstrated in Figure 3b. Lane 1, kb ladder; lane 2, mismatch region 1 Fl/Rl, ATCC; 
lane 3, mismatch region 1, Fl/Rl, CIDMTR; lane 4, mismatch region 1, no template 
control; lane 5, mismatch region 1 F2/R2, ATCC; lane 6, mismatch region 1 F2/R2, 
CIDMTR; lane 7, mismatch region 1 F2/R2, no template control; lane 8, mismatch 
region 2 Fl/Rl, ATCC; lane 9, mismatch region 2 F1/R2, CIDMTR; lane 10, mismatch 
region 2 Fl/Rl, no template control; lane 11, mismatch region 2 F2/R2, ATCC; lane 12, 
mismatch region 2 F2/R2, CIDMTR; lane 13, mismatch region 2 F2/R2, no template 
control, (b) Restriction polymorphisms were as predicted by DNA sequence. PCR 
amplification products were digested with enzymes as indicated. Lane 1, kb ladder; lane 2, 
mismatch region 1 Fl/Rl ATCC, Bgl 2; lane 3, mismatch region 1 Fl/Rl CIDMTR, Bgl 2; 
lane 4, mismatch region 1 F2/R2 ATCC, Bgl 2; lane 5, mismatch region 1 F2/R2 
CIDMTR, Bgl 2; lane 6, mismatch region 1 Fl/Rl ATCC, BamH I; lane 7, mismatch 
region 1 Fl/Rl CIDMTR, BamH I; lane 8, mismatch region 1 F2/R2 ATCC, BamH I; 
lane 9, mismatch region 1 F2/R2, BamH I; lane 10, mismatch region 1 FlRl ATCC, EcoR 
V; lane 11, mismatch region 1 Fl/Rl CIDMTR, EcoR V; lane 12, mismatch region 1 
F2/R2 ATCC, EcoR V; lane 13, mismatch region 1 F2/R2, EcoR V; lane 14, mismatch 
region 2 FlRl ATCC, EcoR I; lane 15, mismatch region 2 FlRl CIDMTR, EcoR I; 
lane 16, mismatch region 2 F2/R2 ATCC, EcoR I; lane 17, mismatch region 2 F2/R2, 
EcoR I. 
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2.(5. Infection In Vivo and Development of a Real-Time PCR Assay for Detection of CIDMTR DNA 

To evaluate DNAemia and end-organ infection in the course of in vivo studies, a strain-specific 
real-time PCR assay was developed in order to differentiate the GPCMV-CIDMTR strain from 
the 22122 strain. This assay focused on the amplification of sequences corresponding to the CIDMTR 
strain GP147.1 ORF (Table 1), since this sequence is absent in the 22122 strain. A GPCMV 147.1 
specific PCR primer pair, consisting of CIDMTR147.1_464F (5'-ATGCAACATAGCGTGCTGAC-3') 
and CIDMTR147.1_583R (5'-GGGACAAAAGCACGATGAAC-3') was designed and utilized for the 
real-time PCR assay (described in detail in 3.5). These primers amplified a 120 bp region of the 147.1 
gene specific for the CIDMTR strain. DNA was extracted from either 100 \\L citrated blood or from 
fresh frozen tissues samples, as described in the Methods section. For quantitative PCR, both 
previously validated primers for the GP83 gene [sequences shared by both strains] and novel primers 
for the GPCMV 147.1 gene [sequences only found in the CIDMTR strain] were used for real-time 
PCR assay. There was strong concordance between the viral load estimates identified by the gp 147.1 
and GP83 real-time primers (data not shown). As a negative control, GPCMV 147.1 primers were used 
in several PCR assays of 22122 DNA, with consistently negative results (data not shown). 

Next, twelve young, GPCMV-seronegative outbred Hartley guinea pigs were divided into two 
groups of six/group. Each group was inoculated with CIDMTR strain virus (pi) at a dose of 1 x 10^ pfU, 
administered subcutaneously, as described in Section 3. Group 1 (n = 6) was treated with 200 mg/kg 
cyclophosphamide on day -1 and 50 mg/kg on day +6 following viral inoculation as described 
previously [32]; group 2 (n = 6) was sham-treated (PBS only). Whole blood and sera samples were 
collected on day 0, 3, 7, and 21. Animals were sacrificed on day 21 and tissue, including lung, liver, 
spleen, and brain, were collected for PCR analysis. In group 1, 5/6 animals were DNAemic, peaking at 
day 7 (mean, 3.2 +/- 0.3 logio genomes/mL) while in group 2, the prevalence of DNAemia was lower 
(3/6 animals; 2.5 +/- 0.35 logio genomes/mL, p = 0.06 compared to group 1). Viral DNA was most 
readily recovered from spleen upon dissection at day 21 post-infection. All 6 animals from each 
group had recoverable CIDMTR strain DNA in the spleen. Total spleen viral load in group 1 
was 2.4 +/- 0.07 logic copies/mg and was 2.4 +/- 0. 1 logio copies/mg in group 2 (p = NS compared to 
group 1). 
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The ability of the CIDMTR strain to infect pregnant animals previously inoculated with the 22122 
strain was next assessed. A total of 6 female seronegative Hartley guinea pigs received a primary 
infection with SG homogenate GPCMV (22122) at a dose of 1 x lO'* pfu, administered subcutaneously. 
A parallel group of animals were sham- treated (PBS only). Then, 5 weeks after inoculation, the 
animals in both groups were mated with GPCMV-seronegative breeders. Dams originally inoculated 
with the SG virus (22122 strain) were documented to undergo seroconversion; in addition, all were 
demonstrated to have low-level DNAemia after inoculation with 22122 virus (data not shown). 
Approximately 5 weeks after the initiation of mating, group 1 was challenged subcutaneously with 
CIDMTR strain at a titer of 6 x 10^ pfii. A total of 19 pups were bom in the experimental group; all 
pups were livebom. When pup tissues were analyzed, 4/19 (21%) of these animals had recoverable 
CIDMTR strain-specific DNA isolated fi-om the salivary gland. One of these animals also had 
detectable DNA in the brain. 

3. Experimental Section 

3.1. Cells, Virus and DNA Preparation 

GPCMV (strain 22122, ATCC VR682), and CPCMV/CIDMTR were propagated on guinea pig 
lung fibroblast cells (GPL; ATCC CCL 158) in F-12 medium supplemented with 10% fetal calf serum 
(FCS; Gibco-BRl), 10,000 lU of penicillin/liter, 10 mg of streptomycin/liter (Gibco-BRL), and 7.5% 
NaHC03 (Gibco-BRL). When the Pi passaged CIDMTR virus exhibited extensive CPE in GPL cell 
culture, cells and virions were pelleted. Aliquots were fixed in phosphotungstic acid for EM studies 
(see below) and the remainder washed in 10 mM Tris (pH 8.0)/l mM EDTA (TE), and lysis buffer 
(200 mM NaCl, 2% SDS, 200 ng/mL proteinase K, in TE), was added. Cells and virions were gently 
suspended, inverted, and incubated at 68 °C ovemight. Following transfer to a 37 °C water bath, three 
successive phenol-chloroform extractions were performed, followed by ethanol precipitation. 

3.2. Transmission Electron Microscopy 

Cells were fixed in 1 mL of 2.5% glutaraldehyde in 0.1 M sodium cacodylate buffer and post fixed 
with 1% osmium tetroxide in 0.1 M sodium cacodylate buffer (all reagents from Electron Microscopy 
Sciences, Hatfield, PA, USA). After three washes in distilled water, samples were dehydrated using a 
25%-100% ethyl alcohol gradient. Samples were then infiltrated with 2:1 ethanol: Embed 812 resin 
(Electron Microscopy Sciences, Hatfield, PA, USA) for 1 hour and subsequently transferred to a 1 :2 
ethanol: Embed 812 resin mixture for 1 hour. Cells were ftirther infiltrated with 100% resin and were 
embedded and incubated at 58 °C for 24 hours to polymerize the resin. Embedded samples were 
trimmed and sectioned on a Leica UC6 Ultramicrotome (Leica Microsystems, Vienna, Ausfria). Thin 
sections (60-70 nm) were obtained and collected on a 200 mesh copper grid (Electron Microscopy 
Sciences, Hatfield, PA, USA) using a perfect loop. Grids were contrasted with 5% uranyl acetate 
for 20 minutes and Santos' lead citrate for 6 minutes. For negative contrast, virions and dense bodies 
were collected from Pi supematants of infected fibroblasts, transferred to airfiige tubes (Beckman-Coulter, 
Brea, CA, USA), and centriftiged at 30 PSI using an airfuge (Beckman-Coulter, Brea, CA, USA) 
for 20 minutes on parafihn and formvar coated copper grids (Electron Microscopy Sciences, Hatfield, 
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PA, USA). Excess liquid was wicked and the grids were stained with 1% phosphotungstic acid for one 
minute. All sections were observed under JEOL 1200 EX II transmission electron microscope (JEOL 
LTD, Tokyo, Japan). Images were obtained using a Veleta 2K x 2K camera with ITEM software 
(Olympus SIS, Munster, Germany) [46]. 

3.3. Deep Sequencing and Sequence Analyses 

For sequence analysis of CIDMTR viral DNA, virions were purified as described previously [44], 
and lysis buffer (200 mM NaCl, 2% SDS, and 200 |ag/mL proteinase K in Tris-EDTA [TE]) was 
added. Following incubation at 68 °C overnight, three phenol-chloroform extractions were performed, 
followed by ethanol precipitation of viral DNA. Genomic sequencing was performed using lllumina 
MiSeq and Pacific Biosciences PacBio RS platforms. Approximately 5.2 million 151-bp paired-end 
MiSeq reads were generated at the University of Minnesota's Biomedical Genomics Center with a 
nominal insert size of 400 bp. Removal of low quality reads and PhiX sequence resulted in a set 
of 4.0 million cleaned reads, approximately ll,000x coverage. Initial scaffolds were generated from 
the cleaned lllumina reads using the ABySS assembler (version 1.3.4) [47]. Scaffold quality was 
assessed manually by comparison with the reference strain, 22122 [44,48,49], and by remapping the 
reads using Bowtie 2 and scrutinizing local coverage and consensus using Tablet [50] and 
SAMtools [51]. Special attention was paid to correct alignment and orientation of the paired ends. 
Regions of weak coverage and scaffold gaps were identified and closed either by manual local 
assembly [52] or by Sanger sequencing. Independent validation of the pseudomolecule was also 
performed using the longer PacBio RS reads, validating the Sanger sequencing and manual assembly. 
SMRT Analysis software [53] produced 998 high quality ("corrected") reads ranging between 
509-15,898 bp, median 6,257 bp (approximately 27x coverage), as well as another set of scaffolds. 
These data were used to evaluate the lUxmiina assembly, specifically its structural correctness, and to 
correct misassembled repeat regions. The resulting complete genome was deposited with the EMBL 
Nucleotide Sequence Database (accession number HG531783). 

3.4. RT-PCR and RACE Analyses 

Reverse-transcriptase PGR was performed on RNA harvested at immediate early times post-infection. 
RNA was extracted from GPL cells infected with 22122 or CIDTMR, at 4 hours post-inoculation, 
using the RNeasy mini kit (Qiagen, Hilden, North Rhine-Westphalia, Germany) according to the 
manufacturer's instructions. RNA was treated with RNase-free DNase Set (Qiagen) while in the 
column according to manufacturer's instructions. cDNA was synthesized from 1 |ag of total RNA 
using Quantitect Reverse Transcription kit (Qiagen). Conventional PCR was carried out using cDNA 
as template and AmpliTaq Gold Fast PCR Master Mix (Invitrogen, Carlsbad, CA, USA). Primers are 
as described in Section 2.4. The PCR products were run in a 0.7% agarose gel. A band of the expected 
size was cut form the gel and purified using QIAquick Gel Extraction Kit (Qiagen). The purified DNA 
was cloned into pCR2.1 (Invitrogen) using the TA Cloning Kit (Invitrogen). Three of clones with 
inserts from each reaction were sequenced with T3 promoter. Ml 3 Reverse and the PCR primers. 

For 5' RACE analysis, RNA was used as template for cDNA synthesis using the GeneRacer kit 
(Invitrogen) in order to generate a cDNA with a specific sequence attached to the 5' UTR for only 
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mRNAs that were capped. After ligation of a specific RNA oligo to the 5'UTR of mRNAs, 
cDNA was synthesized using a combination of oUgo-dT and random hexamers. The cDNA was used 
as template for a first round of PCR using GeneRacer 5' Primer and primer IE2 P5 (5'- 
GGCGTCAATGGGCTCGGGTTTGAT-3'). In order to increase the specificity and quantity of the 
PCR product, nested PCR (nPCR) was performed in a dilution (1:20) of the PCR product. Two nPCR 
reactions were performed with the GeneRacer 5' Nested Primer, per the manufacturer's specifications, 
and IE2 P5 or IE exonS PI (5'- GGCAGCCCCAGTGGATGATTCTGATA-3'). A band of the 
expected size was cut form the gel and purified using QIAquick Gel Extraction Kit (Qiagen). The 
purified DNA was cloned into pCR4-TOPO (Invitrogen) using the TOPO TA Cloning. Clones were 
analyzed for the present of the insert by digestion with EcoR I. Three of the clones with insert fi-om 
each reaction were sequenced with T3 promoter. Ml 3 forward (-20) and the PCR primers. 

3.5. GPCMV-CIDMTR Strain PCR Assay 

For confirmation of viral genome structural differences noted by deep sequencing, PCR was 
performed for each virus using primers indicated in Table 2. The PCR reaction was performed in 
a 50 i^L of total volume using GoTaq long PCR Master Mix from Promega and 1.0 \iM primers. The 
DNA template was total genomic and viral DNA extracted from GPL cells infected with 22122 or 
CIDMTR tissue culture strain. The conditions for the PCR were: initial denaturation at 95 °C 
for 2 min, followed by 95 °C for 30 s, 53 °C for 30 s, 72 °C for 4 min for a total of 35 cycles, and 
elongation at 72 °C for 10 min. The PCR product (4 |xL) was subjected to elecfrophoresis in a 0.7% 
agarose gel. The PCR product from mismatch 1 F2/R2 and mismatch 2 F2/R2 were purified from the 
gel using the Geneclean® II kit (MP Biomedicals, Santa Ana, CA, USA) following the manufacturer's 
instructions. The cleaned PCR product was then sequenced by Sanger sequencing (Functional 
Biosciences, Madison, WI, USA). Purified PCR products were also subjected to restriction 
endonuclease comparisons as indicated in Figure 5b. 

The real-time PCR assay focused on the amplification of sequences corresponding to 
the CIDMTR strain gpl47.1 ORF (Table 1), since this sequence is absent in the 22122 
strain sequence. A GPCMV gpl47.1 specific real-time PCR primer pair, consisting of 
CIDMTR147.1_464F (5'-ATGCAACATAGCGTGCTGAC-3') and CIDMTR147.1_583R (5'- 
GGGACAAAAGCACGATGAAC-3') was designed and utihzed for the real-time PCR assay. 
These primers amplified a 120 bp region of the gp 147.1 gene specific for the CIDMTR strain. 
The specific hydrolysis probe used for detection was CIDMTR147.1_494P (FAM- 
GTGTTCGTGTCCTTGATCGTACGCA-BHQl). A second GPCMV 147.1 specific primer pair, 
CIDMTR147.1_225F (5'-AATGGTTCGCTACGGACATC-3') and CIDMTR 1 47. 1_368R (5'- 
CGGACAACGGAACATACTTG-3') was also utilized in real-time PCR assays. These 
primers amphfied a 144 bp region of 147.1 specific for the CIDMTR strain. The specific 
hydrolysis probe used for detection with this primer pair was CIDMTR147.1_262P 
(FAM-TTCCTCGACGAAGCTCGCGGTATAAT-BHQl). In each instance, the PCR reactions were 
performed in a 25 |xL reaction, using LightCycler 480 Probes Master from Roche (Penzberg, Bavaria, 
Germany); as well as 0.4 |xM primers, 0.1 |xM probe and 0.4 u/|xL of UNG. PCR was performed using 
the LightCycler 480 Real-Time PCR System (Roche) under the following conditions: initial 
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denaturation at 95 °C for 10 min, followed by 95 °C for 10 s, 56 °C for 15 s, 72 °C for 10 s for a total 
of 45 cycles, then a final hold step at 40 °C. The first primer pair (464F and 583R) was chosen for 
detection of viral genome for in vivo studies. Data were analyzed with the LightCycler Data Analysis 
Software (version 1.5; Roche) [54] using standard curves generated using serial dilutions of plasmid 
pCR2.1 with gpl47.1 at known concentrations. Negative results were arbitrarily assigned a level 
of 50 for the purpose of statistical comparisons, based upon limit-of-detection analyses observed in 
other real-time PCR experiments [55]. 

3. 6. Animal Challenge Studies 

All animal studies were performed with the approval of the University of Minnesota Institutional 
Animal Care and Use Committee (lACUC). Some animals were immune suppressed at day 
-1 (200 mg/kg) and day +6 (50 mg/kg) with cyclophosphamide delivered by intraperitoneal injection. 
For experiments described in Section 2.6, both cyclophosphamide-treated and untreated animals 
(n = 6/group) were challenged with CIDMTR strain virus (Pi workpool) at a dose of 1 x 10^ pfia by 
subcutaneous injection. Blood samples were collected at day 0, 3, 7 and 21 post-inoculation and 
animals humanely sacrificed at day 21 for collection of tissue samples for PCR analyses. 
Pregnancy/challenge studies were conducted as described in Section 2.6. Livebom pups were 
sacrificed within 72 hours of delivery for DNA extraction and subsequent PCR. 

4. Conclusions 

The 22122 strain was originally isolated by Hartley in 1957 [27]. As the only characterized isolate 
of GPCMV it was used in virtually all subsequent GPCMV research. However, GPCMV infection is 
common among animals in commercial breeding colonies. In a longitudinal study conducted by 
Hsiung and colleagues from 1974 to 1979, GPCMV-neutralizing antibody was observed in 25% of 
Hartley strain guinea pigs obtained from commercial sources while virus was isolated from only 6 of 
204 animals [56]. The percentages of antibody-positive animals obtained from different sources varied 
from shipment to shipment, ranging from 8%-50%. It does not appear that any of these viral isolates 
were retained. Thus, given only a single characterized isolate, the extent to which diverse GPCMV 
strains have been maintained within these domesticated populations was not previously known. Given 
that Hartley guinea pigs are descended from animals imported from South America to Europe in 
the 16th century, and that these animals subsequently underwent centuries of selective breeding, first 
as pets and later as research animals, the possibility existed that bottlenecks in the breeding history of 
Hartley guinea pigs could have limited the genetic diversity of GPCMV strains currently endemic in 
commercial breeding facilities. 

In the current study, a second virus was independently isolated and characterized, again from a 
Hartley sfrain guinea pig obtained from a commercial supplier in the United States. Based on a high 
degree of genetic relatedness to 22122, predominantly similar restriction endonuclease patterns, and an 
conserved overall genomic structure, we conclude that CIDMTR is a GPCMV and not a novel and 
distinct betaherpesvirus. However, based on restriction pattern polymorphisms and significant 
divergence of amino acid sequences for several envelope glycoproteins (particularly gH and gO), we 
conclude that CIDMTR and 22122 represent two distinct strains of GPCMV and are not minor variants 
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of the same strain. This further suggests that GPCMV strain diversity has been sustained within 
commercial breeding colonies and that other strains exhibiting similar levels of divergence may exist 
and could potentially be exploited to further extend this important animal model. 

The passage history of 22122 is uncertain. In the original report by Hartley, two parallel isolations 
of GPCMV were reported: one, from salivary gland homogenate obtained from guinea pigs purchased 
by a commercial supplier in Yonkers, NY, and a second isolate obtained from a second supplier in 
New York state. Reactivation of virus may have been driven by allogeneic responses engendered from 
injecting SG extract into an animal with a different allotype, as has been suggested may drive HCMV 
and MCMV reactivation during transplantation [57,58]. Allogeneic responses may similarly have 
played a role, along with cyclophosphamide immunosuppression, in reactivation of the CIDMTR 
strain, since repeated tissue culture of salivary gland explants directly obtained from seropositive 
animals failed to result in growth of virus (data not shown). With respect to the original isolation 
of 22122, it is worth noting that serial in vivo passages may have occurred in the context, for some 
animals, of "mixed infection", since Hartley noted that 7.7% (3/39) of "control NIH" strain guinea 
pigs aged 5 months or older were naturally infected, as evidenced by infrequent salivary gland 
inclusions [27]. This strain underwent 22 additional passages in cell culture of fibroblasts, and appears 
to be the strain that eventually was deposited with ATCC. The precise date of deposit is unclear, but 
reports of GPCMV research from the 1960s [59] and early 1970s describe obtaining this strain directly 
from the NIH, while studies since the late 1970s describe obtaining the virus from ATCC [60]. 
Between its original isolation by Hartley and its submission to ATCC, it appears to have undergone 54 
additional passages in guinea pigs, and six additional passages in cell culture (3 passages in guinea pig 
embryo fibroblasts, and 3 passages in CCL 158 cells [61]. The 22122 strain derives from multiple 
rounds of both in vivo passage, some possibly occurring in the context of mixed infection, and 25-30 
passages in cell culture. Thus, it is possible that 22122 underwent changes after isolation from the 
initial animal, either in cell culture or during in vivo passage. 

In contrast, CIDMTR was subjected to minimal passage (one passage in vivo and two passages in 
cell culture) prior to genomic characterization. Thus, the genomic structure and sequence of the 
CIDMTR sfrain may more likely represent a bona fide "wild-type" GPCMV sequence than does the 
22122 sfrain. 

Sequence comparison with the 22122 strain revealed generally good conservation of protein coding 
sequences, although three areas of substantial discontinuity were noted. Thus, each strain contains 
unique sequences that can be used as markers to distinguish the strains during in vivo coinfection 
experiments. One of the regions unique to CIDMTR contains an ORF encoding a fourth putative MHC 
class I homo log not found in the 22122 sfrain. Since the three putative MHC class I homo logs found 
in 22122 appear to be important for the in vivo pathogenesis of infection [55], further functional 
comparisons of the two sfrains will be of interest. In spite of minimal cell culture passage, at least one 
mutation, in the IE1/IE2 start codon, was observed in the tissue culture-adapted CIDMTR virus, 
compared to sequences amplified from salivary gland homogenate. This observation is similar to 
reported nucleotide sequence comparisons between several open reading frames in the DNA of 
different laboratory-adapted strains and clinical isolates of HCMV that has revealed amino-terminal 
sequence extensions of ORFs with altemate start codon usage [62] . Sequence differences with respect 
to start codon usage have also been noted upon comparison of laboratory-passaged and "wild" isolates 
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of MCMV [37]. Future sequence analysis of CIDMTR DNA propagated solely in vivo is therefore 
warranted. Studies in immune competent and immune compromised guinea pigs confirm the ability of 
the CIDMTR virus to disseminate and produce viremia (DNAemia). Variation in glycoprotein protein 
coding sequences were noted, particularly for the gH and gO proteins, suggesting that this virus may 
be useful for the study of re-infection of immune guinea pigs in the guinea pig model of congenital 
cytomegalovirus infection. Cross-neutralization studies examining strain-specific antibody responses 
to envelope glycoproteins would enhance the usefulness of this new strain for the modeling of vaccine- 
mediated protection against re-infection and congenital transmission in this uniquely valuable model. 
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